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Executive summary 


The rise of foundation models offers enterprises exciting new 
possibilities but also raises new and expanded questions about 
their ethical design, development, deployment and use. According 
to a recent IBM Institute for Business Value generative Al survey, 
organizations are already expressing concerns about trust-related 
issues—specifically as barriers to investment. Their top concerns 
are cybersecurity (57%), privacy (51%) and accuracy (47%). 
Many organizations were taking these concerns seriously before 
the consumerization of generative AI, expressing their intent to 
invest at least 40% more in AI ethics over the next three years. 
Awareness about risks and possible ways to mitigate them is 
the first crucial step toward building trustworthy AI systems. 


In this document we: 


ANN Explore the benefits of foundation models, including 
their capability to perform challenging tasks, potential 
nA to speed up the adoption of AI, ability to increase 
productivity and the cost benefits they provide. 


known from earlier forms of AI, known risks amplified 
© by foundation models and emerging risks intrinsic 
to the generative capabilities of foundation models. 


io Discuss the three categories of risk, including risks 
X00 


Cover the principles, pillars and governance that 
form the foundation of IBM’s AI ethics initiatives 
and suggest guardrails for risk mitigation. 


(ef) 
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Introduction 


As the use of AI continues to expand, large and complex AI models 
are delivering promising performance results, as well as solving 
some of society’s most challenging problems. However, building 
large training data sets and complex models for each AI application 
can be burdensome for enterprises. Foundation models provide a 
path to achieve the best of both worlds: build powerful state-of- 
the-art models and reuse them directly or apply tuning methods to 
implement a variety of use cases, rather than train new models for 
each use case. For example, IBM Research® developed foundation 
models for visual inspection. These foundation models learn the 
general representation of concrete surfaces and runways and 
can be further tuned for specific use cases like crack detection 
or defect inspection with less labeled data. 


IBM defines a foundation model as an AI model that can be 
adapted to a wide range of downstream tasks. Foundation 
models are typically large-scale generative models that are 
trained on unlabeled data using self-supervision. As large-scale 
models, foundation models can include billions of parameters. 


IBM is a hybrid cloud and AI company with a long reputation as 

a responsible data steward committed to Al ethics. Using the 

strength of our research, product and consulting teams, along 
with external partners, such as Hugging Face, we help bring the 

power of foundation models to our clients and build trustworthy 
Al across any enterprise. IBM also continues to invest in building 
new platforms, such as the IBM® watsonx™ AI and data platform 
and technologies, for designing and developing AI models 

to behave in an auditable and trustworthy manner. 


This document describes the point of view of IBM on the ethics 
of foundation models. It is the first version, and future versions 
will expand on various aspects of IBM’s foundation model ethics 
approach. We hope this document is helpful for all stakeholders 
in developing, deploying and using the foundation model ina 
responsible way. 
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Benefits of 
foundation models 


Foundation models can significantly improve the process of 
developing AI systems and help advance AI from the exploration 
to the adoption phase in enterprises. Their benefits include: 


Performing complex tasks 

Foundation models show a significant increase in 
performance in solving difficult and complex problems. 
For example, the geospatial foundation model from the 
IBM and NASA collaboration is designed to convert NASA’s 
satellite data into maps of natural disasters like floods 
and other landscape changes. The model could also be 
used to help reveal our planet’s past; estimate risks to crops, 
businesses or infrastructures due to severe weather; develop 
strategies to adapt to climate change; and assist agribusiness. 
The model is planned to be made available in preview to IBM 
clients through the IBM Environmental Intelligence Suite. 


As another example, IBM’s MoLFormer-XL is a foundation 
model that infers the structure of molecules from simple 
representations and makes it easy to learn various downstream 
tasks like predicting a molecule’s physical and quantum 
properties, identifying similar molecules, screening already 
approved molecules for new use cases, and discovering 
new molecules. Moderna and IBM are exploring ways to 
use MoLFormer to help predict molecule properties and 
understand the characteristics of potential MRNA medicines. 


Increased productivity 

The generative nature of foundation models expands the number 
of areas where AI can be used in an enterprise to help improve 
productivity by automating routine and tedious tasks and allowing 
users to spend more time on creative and innovative work. For 
example, IBM Watson® Code Assistant, powered by foundation 
models, enables developers of all experience levels to write code 
using Al-generated recommendations. 


Quicker time to value 

Foundation models are usually trained with unlabeled 
data, which is more accessible in larger quantities than 
labeled data. Once trained, foundation models can be 
used either directly or after being tuned for downstream 
applications, using a small amount of specialized labeled 
data, which can decrease time-to-value creation. 
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Utilize diverse data modalities 

Foundation models may be trained using various data modalities, 
such as natural language, text, image and audio. They can also 
be applied to tasks requiring different data types, such as time 
series data, geospatial data, tabular data, semi-structured data 
and mixed-modality data like text combined with images. 


Amortized expenses 

Although the initial cost of training a foundation model 

is significantly higher than training a traditional AI model, 
the incremental cost of applying it to a new task is 
considerably lower. Using pretrained foundation models 
could help eliminate the requirement that enterprises 
make substantial investments to train foundation models 
to experiment with their new capabilities. For an enterprise, 
the trustworthiness of the models, energy efficiency, 
performance, portability and the ability to use enterprise 
data effectively and securely are paramount. 


IBM allows enterprises to create 
and own the value of foundation 
models for their business by 
bringing the best innovations from 
the open, global AI community, 
running efficiently in hybrid 
computing environments, helping 
mitigate risks, and rigorously 
governing AI. 
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Risks of 
foundation models 


Like all rapidly advancing technologies, foundation models have 
risks along with benefits. Some are legal risks, for example, 
restrictions on moving or using data, and need to be carefully 
evaluated under current and evolving law. Other risks have an 
ethical nature and must be considered carefully so that the 
technology has a positive impact. In general, AI risks raise 
sociotechnical questions and should be addressed and mitigated 
through sociotechnical methods, including software tools, risk 
assessment processes, AI ethics frameworks, governance 
mechanisms, multistakeholder consultations, standards and 
regulation. We can easily demonstrate the risks by considering 
the following 3 categories: 


1. Traditional. Known risks from prior or earlier forms 
of AI systems 

2. Amplified. Known risks but now intensified because 
of intrinsic characteristics of foundation models, 
most notably their inherent generative capabilities 

3. New. Emerging risks intrinsic to foundation models 
and their inherent generative capabilities 


This information is useful, since there already are mitigation 
mechanisms that could be helpful in partially addressing the 
traditional and amplified risks. We also structure the list of risks in 
relation to whether they’re mostly associated with content provided 
to the foundation model—the input—or the content generated 

by it—the output—or if they’re related to additional challenges. 
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1. Risks associated with input 


Group Risk Indicator 
Training and tuning phase Fairness Bias like historical, representational or measurement bias Amplified 
Robustness An adversary or malicious insider injecting false, misleading, Traditional 
malicious or incorrect samples 
Value Using undesirable output, such as inaccurate or inappropriate user New 
alignment content, from downstream applications for retraining purposes 
Data laws Legal restrictions on moving or using data Traditional 
Intellectual Copyright and other IP issues with the content Amplified 
property 
Transparency The ability to disclose what content is collected, Amplified 
how it will be used and stored, and who has access 
Privacy Inclusion or presence of personal identifiable information Traditional 
and sensitive personal information 
Challenges around the ability to provide data subject rights, Amplified 
for example, opt out, right to access or right to be forgotten 
Inference phase Privacy Disclosing personal information or sensitive personal information New 
as a part of prompt sent to the model 
Intellectual Disclosing copyright information or other IP information New 
property as part of the prompt sent to the model 
Robustness Vulnerabilities to adversarial attacks like evasion, which is Amplified 
an attempt to make a model output incorrect by perturbing 
the data sent to the trained model 
Vulnerabilities to adversarial attacks like prompt injection, New 


which forces a different output to be produced; prompt leaking, 
which is the disclosure of the system prompt; or jailbreaking, 
which is avoiding guardrails established in the system prompt 
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2. Risks associated with output 


Group Risk Indicator 
Fairness Bias in the generated content New 
Performance disparity across individuals or groups Traditional 
Intellectual Copyright infringement, including compliance with open-source New 
property license agreements 
Value Hallucination—false content generation New 
alignment 
Toxic, hateful, abusive and aggressive output New 
Misuse Spread disinformation—deliberate creation Amplified 
of misleading information 
Generate toxic, hateful, abusive and aggressive content New 
Nonconsensual use of people’s likeness—deepfakes Amplified 
Dangerous use—creating plans to develop weapons or malware New 
Deceptive use of generated content—intentional nondisclosure New 
of Al-generated content 
Harmful code Execution of harmful generated code New 
generation 
Privacy Exposing personal information or sensitive personal information New 
in generated content 
Explainability Challenges in explaining why output was generated Amplified 
Traceability Challenges in determining original source and facts New 


of the generated output 
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3. Challenges 


Group Risk Indicator 
Governance Transparency Documenting data and model details, purpose, Traditional 
potential uses and harms 
Accountability | Determining responsibility for misaligned generated output Amplified 
along the Al lifecycle and value chain 
Legal compliance Intellectual Determining creator of downstream models New 
property 
Determining creator of open-source foundation models New 
Determining ownership of AI-generated content New 
Legal uncertainty about intellectual property rights related New 
to generated content 
Legal Determining downstream obligations Amplified 
uncertainty 
Societal impact Impact Human displacement—AI-induced job loss—and reskilling, which Amplified 
on jobs can particularly impact creative jobs or jobs with repetitive tasks 
Human Human exploitation—ghost work in training; inadequate Amplified 
dignity working conditions; lack of healthcare, including mental 
health; unfair compensation 
Environment Increased carbon emission due to high energy requirement Amplified 
to train and operate 
Diversity and Homogenizing of culture and thoughts New 
inclusion 
Human Misinformation and disinformation generated by foundation Amplified 
agency models, including generation of manipulative content 
Impact on Bypassing the learning process and plagiarism New 
education 
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Principles, pillars 
and governance 


IBM’s Principles for Trust and Transparency and Pillars for 
trustworthy AI are the foundation for IBM’s AI ethics initiatives. 
IBM has an AI Ethics Board with the mission to support a 
centralized governance, review and decision-making process 
for IBM Al ethics policies, practices, communications, research, 
products and services. The board includes a diverse set of 
stakeholders from across the company and is supported by 
a community of IBM employees who serve as AI focal points 
and AI ethics advocates. Through the board, IBM’s principles 
are put into practice. As new technology emerges, such 

as foundation models, the IBM AI Ethics Board is actively 
engaged in supporting alignment with these Principles 

and Pillars, which evolve to address new AI ethics issues. 
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Guardrails 
and mitigations 


IBM has established an organizational culture that supports 
the responsible development and use of AI. Based on the 
IBM Institute for Business Value AI ethics in action report, 

Al ethics has already become more business-led versus 
technology-led, and nontechnical executives are now the 
primary champions for AI ethics, increasing from 15% in 2018 
to 80% 3 years later. Additionally, 79% of CEOs are now prepared 
to act on AI ethics issues, up from 20%. We recognize that 
responsible AI is a sociotechnical area that requires a holistic 
investment in culture, processes and tools. Our investment in 
our own organizational culture includes assembling inclusive, 
multidisciplinary teams and establishing processes and 
frameworks to assess risks. 


IBM is engaging in cutting-edge research and developing 
tools to help support professionals throughout the lifecycle of 
responsible and trustworthy AI. The watsonx enterprise-ready 
Al and data platform is built with 3 components: the IBM 
watsonx.ai™ AI studio, IBM watsonx.data™ data store and IBM 
watsonx.governance™ toolkit. IBM’s AI governance technology 
enables users to drive responsible, transparent and explainable 
AI workflows. This technology includes IBM Watson OpenScale, 
which tracks and measures outcomes from AI models through 
their lifecycle and helps organizations monitor fairness, 
explainability, resiliency, alignment with business outcome and 
compliance. IBM has also developed several methods to help 
with bias issues like FairlJ, Equi-tuning, and FairReprogram. 
Read more about additional open-source trustworthy AI tools. 
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Additional guardrails and mitigations include: 


Transparency reporting 

Using standardized factsheet templates is one way 

to accurately log details of the data and model, purpose, 
and potential use and harms. 

Read more here > 


Filtering undesirable data 

Using curated, higher-quality data can help mitigate certain issues. 
IBM is developing filtering techniques to help reduce the chances 
of producing undesirable, misaligned content by removing hate 
language, biased language and profanity from the data. 

Read more here > 


Domain adaptation 

Training a foundation model to a specific domain or industry 
can help minimize the scope of risk the models can give rise 
to because it can be conditioned to generate outputs that 
are tuned to be more relevant to that domain or industry. 
Read more here > 
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Human oversight and human in the loop 

Human oversight and review can help identify and correct 

errors and biases in the generated output. Also, human BE 
validation and feedback on the quality of model responses 

help ensure that the generated content is accurate, relevant, 

of high quality, not drifting and aligned. 

Read more here > 


Consulting engagement 

IBM Consulting™ is dedicated to helping clients with the safe 
and responsible use of Al irrespective of the preferred tech 
stack. They help clients nurture a culture that adopts and scales 
AI safely, creates investigative tools to see inside black box 
algorithms and makes sure clients’ corporate strategy includes 
strong data governance principles. 

Read more here > 


IBM Enterprise Design Thinking j Va 
IBM Enterprise Design Thinking® methods and frameworks, such d ; 
as Team Essentials for AI, help clients define ethical behaviors = 
throughout the AI design and development process. 

Read more here > 


AI Ethics review 

Assessment of capabilities, limitations and risks in AI 
projects helps ensure the responsible development 
and use of the technology. 


Ethics by Design 

Ethics by Design is a structured framework with the goal of 
integrating tech ethics in the technology development pipeline, 
including, but not limited to, AI systems. Ethics by Design 
enables AI and other technologies as a force for good by 
embedding tech ethics principles throughout products, 
services and broader operations. 


Team diversity 

Diversity in the teams that build and train AI systems, including 
foundation models, helps ensure that a variety of perspectives 
and experiences are considered. This diversity improves the 
accuracy and performance of AI systems and helps reduce risks 
throughout the AI lifecycle, including the potential for adverse 
outcomes that impact groups that may not be well represented 
on less diverse teams. 
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AI policies, regulation 
and best practices 


A Policymaker’s Guide to Foundation Models introduces what 
policymakers need to know about foundation models. This blog, 
from the IBM Policy Lab, aims to help policymakers in the complex 
task of regulating the use of generative AI, aiming to avoid the 
risks without limiting innovation and beneficial opportunities. For 
additional information on IBM’s recommendations to policymakers, 
read IBM Chief Privacy and Trust Officer Christina Montgomery’s 
testimony before the U.S. Senate Judiciary Subcommittee on 
Privacy, Technology and the Law here. 


IBM is making an impact in shaping regulatory policy, industry 
best practices and tools, governance of emerging technologies, 
and sociotechnical research by leading and contributing to 
initiatives with organizations, such as: 


— The World Economic Forum 

— Partnership on AI 

— The International Association of Privacy 
Professionals (IAPP) AI Governance Center 

— The IEEE Global Initiative on Ethics of Autonomous 
and Intelligent Systems 

— Christina Montgomery’s service on the National 
Artificial Intelligence Advisory Committee (NAIAC) 

— The United Nations Global Digital Compact 

— The Global Partnership on Artificial Intelligence (GPAI) 

— The Organisation for Economic Co-operation 
and Development (OECD) 

— The Data & Trust Alliance 
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IBM has strong academic partnerships like the MIT-IBM 

Watson AI Lab, where a community of scientists at MIT and 
IBM Research conducts AI research and works with global 
organizations to bridge algorithms to their impact on business 
and society. The Notre Dame-IBM Tech Ethics Lab was formed 
to address the many diverse ethical questions implicated by the 
development and use of advanced technologies, including AI, 
machine learning (ML) and quantum computing. The Stanford 
University Human-Centered Artificial Intelligence (HAI) research 
advances AI research, education, policy and practices. 
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Keep watching this space to learn 
more about the latest developments 
in foundation models and how IBM 
is working toward the responsible 
development and use of this and 
other technologies. 


~~ 
he. 
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